1,087 research outputs found

    Bethe Projections for Non-Local Inference

    Full text link
    Many inference problems in structured prediction are naturally solved by augmenting a tractable dependency structure with complex, non-local auxiliary objectives. This includes the mean field family of variational inference algorithms, soft- or hard-constrained inference using Lagrangian relaxation or linear programming, collective graphical models, and forms of semi-supervised learning such as posterior regularization. We present a method to discriminatively learn broad families of inference objectives, capturing powerful non-local statistics of the latent variables, while maintaining tractable and provably fast inference using non-Euclidean projected gradient descent with a distance-generating function given by the Bethe entropy. We demonstrate the performance and flexibility of our method by (1) extracting structured citations from research papers by learning soft global constraints, (2) achieving state-of-the-art results on a widely-used handwriting recognition task using a novel learned non-convex inference procedure, and (3) providing a fast and highly scalable algorithm for the challenging problem of inference in a collective graphical model applied to bird migration.Comment: minor bug fix to appendix. appeared in UAI 201

    Conservation theorems for the Cohesiveness Principle

    Full text link
    We prove that the Cohesiveness Principle (COH) is Π11\Pi^1_1 conservative over RCA0+IΣn0RCA_0 + I\Sigma^0_n and over RCA0+BΣn0RCA_0 + B\Sigma^0_n for all n≥2n \geq 2 by recursion-theoretic means. We first characterize COH over RCA0+BΣ20RCA_0 + B\Sigma^0_2 as a `jumped' version of Weak K\"{o}nig's Lemma (WKL) and develop suitable machinery including a version of the Friedberg jump-inversion theorem. The main theorem is obtained when we combine these with known results about WKL. In an appendix we give a proof of the Π11\Pi^1_1 conservativity of WKL over RCA0RCA_0 by way of the Superlow Basis Theorem and a new proof of a recent jump-inversion theorem of Towsner

    Synthesizing Normalized Faces from Facial Identity Features

    Full text link
    We present a method for synthesizing a frontal, neutral-expression image of a person's face given an input face photograph. This is achieved by learning to generate facial landmarks and textures from features extracted from a facial-recognition network. Unlike previous approaches, our encoding feature vector is largely invariant to lighting, pose, and facial expression. Exploiting this invariance, we train our decoder network using only frontal, neutral-expression photographs. Since these photographs are well aligned, we can decompose them into a sparse set of landmark points and aligned texture maps. The decoder then predicts landmarks and textures independently and combines them using a differentiable image warping operation. The resulting images can be used for a number of applications, such as analyzing facial attributes, exposure and white balance adjustment, or creating a 3-D avatar
    • …
    corecore